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Abstract. We improve and subsume the conditions of Johansson and Oberg 
[18) and Berbee 2 for uniqueness of a g-measure, i.e., a stationary distribution 
for chains with complete connections. In addition, we prove that these unique 
g-measures have Bernoulli natural extensions. In particular, we obtain a unique 
^-measure that has the Bernoulli property for the full shift on finitely many states 
under any one of the following additional assumptions. 
(1) 



(2) For any fixed e > 0, 



(var n logp) 2 < oo, 



y ' g~( 2+ e J( var l lQ gg^ hvar„logg) _ ^ 

n=l 

(3) 

var n log g = o I 



in , 

That the measure is Bernoulli in the case of ([1]) is new. In ((2| we have an im- 
proved version of Berbee's condition (concerning uniqueness and Bernoullicity) 
[2J, allowing the variations of log p to be essentially twice as large. Finally, @ is 
an example that our main result is new both for uniqueness and for the Bernoulli 
property. 

We also conclude that we have convergence in the Wasserstein metric of the 
iterates of the adjoint transfer operator to the g-measure. 



1. Introduction 



Let S be a countable set. Let Z+ = {0,1,2,...}, Z = {..., -1, 0, 1, 2 ...}, X = S z , 
X+ = <S* + and X- = S'^ z+ . Any bi-infinite sequence x S X and n £ Z, gives a 
one-sided infinite sequence = (x_ n ,x_ n+ i, . . .) in X + . Moreover, the stochastic 
process {x^} n& z has the Markov property for any distribution of x in A4(X), where 
Ai(X) denotes the Borel probability measures on X, with respect to the product 
topology on X. 
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Let g > be a continuous function on X + such that 

(1.1) ^ 9{xqx) = 1, x G X+. 

A distribution fi G .M(X) of x G X is a g-chain if 

(1.2) ^(V n )|x (n ~ 1} ) = #(V n ) 



for all n > 0. Thus, the process depends on the past according to the ^-function. 
Note that the distribution of a g-chain is uniquely determined by the distribution 
MO ( x (o))-l £ X(x + ) of its "initial" value x^> . 

If (7 depends only on the choice of the new state then we have an i.i.d. process, and 
if g depends on the new state and the previous one, then we have a Markov chain 
on the countable set S. If we have dependence on the k previous states, before 
moving to the new state, we have a /c-chain, and if there is no such restriction on 
the dependence, we have a chain with complete, or infinite, connections. 

In this paper, we will restrict our attention to the case when S is a finite set and g > 0. 
A stationary measure for our process is sometimes called a g- measure, see Keane [22] . 
who introduced this notion in ergodic theory. Important contributions were also 
provided by Ledrappier [23], where in particular it was shown that g- measures are 
equilibrium states, and Walters [26], where the theory of ^-measures was connected 
with the transfer operator theory for general potentials. The theory has also had 
a long, but slightly different appearance in the probability theory of chains with 
complete connections, see e.g. Doeblin and Fortet (1937) [9j, where it was proved 
that uniqueness of g- measures follows from summable variations, and the works 
by Iosifescu and co-authors, for instance that with Theoderescu in [16J and with 
Grigorescu [15]. The theory is also connected to that of iterated function systems, 
or iterated random functions; see Diaconis and Freedman [8] and the references 
therein. A recent contribution by Iosifescu is [14] , We have not attempted to give a 
complete survey of the literature, but rather to point the reader in some important 
main directions of the different appearances of the problems we are considering here. 

If T is the left shift map on X + , then a g- measure can alternatively be viewed as T- 
invariant probability measure /i G A4(X + ), with the property that g = dfi/d(n o T). 
Since X + is compact due to the finiteness of S, it follows that there always exists a 
(^-measure. Uniqueness is however not automatic, as was clarified by Bramson and 
Kalikow in [5]. Examples of non-uniqueness have since then been provided in, e.g., 
[4] and P]. 

A useful way of viewing a ^-measure is as a fixed point of the dual C* of the transfer 
operator C, defined pointwise by 

Cf{x) = 9(y)f(y), 

Ty=x 
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where C : C(X + ) — > C(X + ). Hence, a ^-measure can be viewed as a probability 
measure satisfying C*fi = \x. 

If we do not impose the probability assumption (jl.ip . the eigen-measure of the dual 
of the transfer operator is not invariant in general, but may instead look for eigen- 
measure solutions v of C*u = Az/, where A > is the greatest eigenvalue of the 
unrestricted transfer operator C, 

cf(x) = eHy)f(y), 

Ty=x 

where <f> is the potential function, usually belonging to a function space with the 
same regularity conditions as the test functions /. 

In this paper our results only concern the case of probabilistic weight functions, that 
is 4> = log 9) where g satisfies (jl.ip . In [18J, it was proved that there exists a unique 
g-measure if g > and 

oo 

(1.3) ^(var n log£) 2 < oo, 

n=l 

where the nth variation of a function / is defined as 

var n / = sup \f(x) - f(y)\, 

x~ n y 

where x ~ n y means that x and y coincide in the first n coordinates. 

This condition of square summability of variations of the ^-function for the g-chain 
is proven [3] to be sharp, in the sense that for all e > there exists a ^-function such 
that 

oo 

(var n logs0 2+e < 00 > 

n=l 

with more than one ^-measure. This should be compared to an older result of Dyson 
|10j for general potentials <j>, identifying summability of variations as sharp, in the 
sense that we may have multiple eigen-measure solutions of Cu = \v, when 

oo 

J2(var n <f>) 1+e < oo. 
n=l 

In view of this dichotomy in terms of summability of powers of variations, Berbee's 
two results from the late 1980s are intriguing. He proves uniqueness of a ^-measure 
and of an eigen-measure in the general case, when 

oo 

(1.4) J2 e ~ n ~'' ~ r " = °°' 

n=l 



i 



ANDERS JOHANSSON, ANDERS OBERG AND MARK POLLICOTT 



where r n = var n log g or r n = var n (f>, respectively. This allows for the non-summable 
sequence r n = -. In the case of general potentials this is sharp, modulo a con- 
stant factor, see pQ, but obviously not for g- measures, since square summability of 
variations cover sequences r n = -rpi+i , e > 0. 

Since it was shown in [18] that there are sequences that satisfy Berbee's condition 
but not square summability, it becomes interesting in the case of proving uniqueness 
of a g-measure to ask if there is a condition that subsumes in a natural way these 
two uniqueness conditions. We provide conditions for uniqueness that contains both 
square summability of variations and Berbee's condition for a unique ^-measure. 

Our method of proof also allows us to conclude that the unique g- measure is Bernoulli, 
meaning that if we look at the natural extension of the dynamical system, i.e., 

^ (x_ n , X__ n j r \, . . .), 

n > 0, with the ^-measure /i as initial distribution for x^°\ then this stochastic 
process is isomorphic to an i.i.d. process. 

The Bernoulli property was also proved by Berbee, but is new for square summability 
of variations (convergence for the iterates of the transfer operator is known from [19]). 
For instance we prove that we have a unique g-measure that is furthermore Bernoulli 
under the following three special conditions: 



(1) 



(2) For any fixed e > 0, 



^2 (varn log fiO 2 < oo; 



n=l 



^ e -(|-N)(n+-+r„) =oo; 



n=l 



var„ !(.» // - o[ — ) . n — x . 



(3) 



The last example is in a sense the weakest condition we have for a unique Bernoulli g- 
measure. The second is an improvement of Berbee's condition with a constant, owing 
to our method. For other results concerning the Bernoulli property for ^-measures 
and equilibrium states for general potentials, see [28J. 

It would be interesting to investigate whether there is a sharp constant so that we 
have uniquness and perhaps the Bernoulli property for var n log g < . Perhaps the 
< should be replaced by a < and perhaps the constants are different for uniqueness 
and for the Bernoulli property. 

Our method of proof relies on two main ideas. 
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Firstly, we use a forward block coupling, including solving the renewal equation to 
obtain an estimate of the probability of having conflicts between two extensions of a 
g-chain, starting from two different distributions. This argument is then applied to 
a perturbation of one of the extensions to a sequence of ^-functions corresponding 
to a sequence of Bernoulli measures that converges in the d-metric to the unique 
^-measure under investigation. 

Secondly, we use Hellinger integral estimates from |17j to calculate the probability 
of not having a conflict (that is, different entries in a corresponding coordinate) 
in the extensions of two initial distributions when we add a new block of positive 
integer length bi (at a certain height I > 1 in the extension). We show that if these 
probabilities are e~ Pl , the maximal probability of not having a conflict, as defined 
through the total variations distance, then we can approximate pi in such a way that 
it asymptotically includes a square sum of the variations, where the sums are taken 
over the increasing blocks. More precisely, if we define recursively an increasing 
sequence of natural numbers Bi = £>;_i + 6j, I > 1, Bq = 0, we get the estimate 



In the special cases (1) and (3) above, we have found examples of exponential increase 
of bi in I. If bi = 1 for all Z > 1, we obtain Berbee's situation, in which case 
pi < ri = var/logg. However our estimates show that although this is of the right 
order, our method allows one to improve Berbee's result by a constant; essentially, 
the variations are allowed to be twice as big. 

We can now state one version of our main result. 

Theorem 1.1. We obtain a unique g-measure which is Bernoulli, if there is a se- 
quence of positive integers {bi}fl 1 such that, with {ri} defined from {bi} as above, 
limsupn = and 



Pl < (l + o(l))«, 



where 




k=B l _ 1 



Finally, we define 




oo 




1=1 
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2. Preliminaries 

2.1. The Bernoulli property and the d-metric. Let M 9 (X) C M{X) denote 
the set of g-chains corresponding to the (^-function g, i.e. the set of fi such that 

Let Atj.(X) denote the set of 5- measures. 

On .M(A" + ) we have the natural filtration {.Fn} of the Borel u-algebra, where J- n = 
cr(xo, . . . ,x n _i). For a measure v G .A/f(X + ) and a sub cr-algebra B C T, we let 
denote the restriction to 6. 

Recall that coupling (or joining) between two probability distributions \x G .M(X, JF) 
and /U G .M(X, J 7 ) is a probability distribution f G M(l x y,7(g) J) of a pair 
(x, x) ~ X x X such that the marginals are are given by x ~ \i and x ~ fi. For a 
pair of probability measures (//, /t) on the measure space J 7 ), where X = S z 

and J 7 denotes the corresponding product cr-algebra, let 

d(/j,,p,) := inf limsup^{x_ n / x_ ra }, 

y n— >oo 

where the infimum is taken over all couplings v between \x and \x. This corrsponds 
to the d- metric introduced by Ornstein (for a reference, see e.g., [7] or [25]), if we 
take the restriction to the space M.t{X) of shift invariant measures; on M{X) it is 
a pseudo-metric. Notice that in our case, the definition of d uses couplings that are 
not necessarily translation invariant even if the marginals are. In [7], the authors 
define d on Mt{X) by taking the infimum over couplings that are invariant under 
the transformation T x T on X x X. However, the original definition by Ornstein 
does not presuppose translation invariant couplings. 

An invariant measure fx G Mt(X) is Bernoulli if it can be realised by an isomorphism 
with a Bernoulli shift. In other words, there is a bijectively measurable mapping 
<fi : A z — > X such that o T" = T o 0, where T' denote the shift on and such 
that /x = n' o (f)^ 1 where // is a Bernoulli shift, which means that, under //, each 
symbol is chosen independently according to some fixed discrete probability on the 
finite set A. Ornstein proves in [25] that the set B of measures in A4t(X) having 
the Bernoulli property is closed in the topology induced by the d-metric. Many 
classes of (7-functions are well-known to give rise to unique g-measures with the 
Bernoulli property. In particular, if the g-function is determined by a finite number 
of coordinates, i.e., it is the transition probabilities for iV-chains, for some finite N; 
see e.g. [25] or [Tj. We also remind the reader of the results of Walters, see [28] . 

It easy to see that any given g-function g with varjv log g — > as N — > 00 can be 
arbitrarily well approximated by finitely determined ^-functions, e.g. let 5at(x) = 
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g(xo,xi, . . . ,Xffz), for a fixed z G X+, whence 

|| logg N - logg||oo <varArlogfif. 

Let n and /t denote g-chains corresponding to the ^-functions g and g, respectively. 
Our strategy — which is similar to that in used in [7] - - for proving that the g- 
measure \i is Bernoulli, is first to show that, the d-distance between \i and p, can be 
bounded by a function which is continuous in s = || log g — log <7||oo and that fixes 
zero. 

A finite block- structure is a sequence {bi}f£ l of positive integers b\ > 0. We refer 
to the index I as levels. By a block-variation pair, we mean a block-structure {&;} 
in conjunction with a sequence {r{\ of positive real numbers. For a block- variation 
pair {{ri},{bi}) = ({n}f£i> {h}f£i) we define a real number 

^.w):= i+ ^'" n (i r'- n) . 

where for simplicity we have adopted the convention that e _ri Vl ~ 1 = 1 for Z = 1. 
A block-variation function r associates a positive real number r(B,b) to integers 
i? > and b > 0. Given a block-structure {6;} and a block-variation function r, we 
define the corresponding sequence {r{\ by setting 

(2.2) n :=r(&i + &2 + ••• + &,_!,&/). 

In this context, we will denote the pair {{r{\, {b{\) by (r, {bj}). 

Our first lemma establishes a bound on the d-metric between g-chains which is 
continuous in the supremum norm. 

Lemma 2.1. Let g and \i be as above. There is a block-variation function p 9 {B,b), 
such that for any block-variation pair ({r{\,{bi}) satisfying 

(2.3) P \ < n 

we have 

(2.4) I + s •&*},{&,}), 
for all g-chains fi corresponding to a g-functions g with 

|| log£/ - fog<?l|oo = S. 

We say that pairs ({n}, {bi}) satisfying (12. 3p are valid for g. We prove this lemma in 
the next subsection. Note that, for a fixed finite pair {{ri}f£ l , {^}^ 1 ), the quantity 
5({ri}, {bi}) is clearly continuous in {r;} so that in particular 

lim *({ri + a 6,} ) {6 l }) = 5({r I } J {6i})- 
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To see how we can deduce the the Bernoulli property, notice that if 
(2.5) inf 8({n},{bt}) = 0, 

where the infimum is taken over all pairs ({ri}, {bi}) that are valid for g. Then, 
for every e > 0, we can find a block-structure with 5(g, {6^}) < e. By the 

continuity of 5(-, {tf}) we can take a finitely determined (locally constant) ^-function 
g with (/-measure fi such that 

d(n,fi) <5(r + || logs- - log5||oo,{6;}) < 2e, 

say. It follows that the d-distance between the (/-measure [i of g and the set B 
of Bernoulli measures is zero and since B is closed with respect to the d-distance 
[25], we conclude that [i 6 B. Moreover, it is well-known and easy to see that this 
(/-measure corresponding to g must be unique. We collect the conclusions in the 
following Theorem. 

Theorem 2.2. // (12. 5p holds then we have a unique Bernoulli g-measure fj, corre- 
sponding to g. Moreover, ju is attractive in the sense that C* n v converges weakly to 
\x for any initial distribution v € M{X + ). 

We prove the last statement in Section [3l 

2.2. The coupling argument and the proof of Lemma 12.11 In order to obtain 
the bound in ()2.4|) . we will need to construct a coupling between a (/-chain [i and a 
(/-chain ft, by defining the two chains x ~ \i and x ~ /2 on the same probability space 
(£l,J-,F). Assume that s = || logg — log(/||oo. The distributions of x^ and are 
arbitrary. 

The coupling we construct uses a block-structure {bi}, where we, at certain times 
n, extend the two (/-chains with block of symbols of length 6/ until we reach a 
conflict — i.e. a coordinate with different symbols — in the extension. Extending 
the two chains x^ and x^ with a block of length 6/, means specifying a distribution 
of the pair (x^ n+bl \ x {n+b ^) such that x^ 1 ^ has distribution C* bl 5 x{n) and x {n+b ^ 
has distribution C*~ bl 5 x („). We are at level I when we extend with a 6;-block and 
this presupposes, that previously, without conflict, we have extended with blocks at 
levels 0,1,...,/ — 1 of a total length 

Bi-i = b 1 + b 2 + ■■■ + b l _ 1 . 

For (y,y) G X + x X + , define the concordance time as the non-negative integer 

k(v, y) = sup{A; > : y ~ fc y}. 
The event of success (or "no conflict") means that that 

K{x (n+bl \ X (n+bl) ) = K(x (n) , X (n) ) + bt. 
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We always use a maximal coupling between the chains, i.e., a coupling that makes 
the probability of success maximal. 

We show (|2.4p in Lemma l2.lt by defining on the same probability space (£l,J~, IP) 
a Markov chain Y n taking values in Z. Given a block-variation pair ({f/}, {fy}), we 
define an associated Markov chain Y n = Y^f 1 ^ ^ , n > 0, as follows: Let Yo = 0. If 
y n ^ for some I, simply let Y n+ i = Y n + 1, but, if Y n = Bi_\ for some I = 1, . . . , M 
then 



(2.6) Y,, 



n+l 



+ 1 with probability e r < 
,—bi with probability 1 — e~ Tl . 

If y n = Bm we set y n +i = 0, because we want to avoid to have infinite waiting time 
in mean when we later solve the renewal equation. 

By using the Renewal Theorem, we show in section [3] the following. 

Lemma 2.3. Assume that the Markov chain Y n is defined from parameters r and 
{b t } as in ([H]). Then 

limsupP{y n < 0} < S(r,{bi}) 

n— >oo 

where 5 is defined in (|2,lj) . 

We couple the Markov chain Y n = Yn' l+sbl ^'^ bl ^ with the block-extensions such that, 
for all n, 

(2.7) K(iW,iW)>y n . 

Since X- n / X- n precisely when K,(x^ n \x^) = it then becomes clear from 



Lemma 12.31 that 

(2.8) d(p,jj) < limsupP{y n < 0} < 5({r z },{6 z }), 

which is (|2.4p in Lemma 12. 11 

We execute, at time n, a block-extension at level I, precisely when Y n = Bi_\. In 
order to maintain (|2.7p . we should couple the transition of Y n so that Y n = — b[ if the 
extension is unsuccessful; then (|2.7p holds up true to time n + bi even if coordinates 
between — n and — n — bi should disagree. A sufficient and necessary condition for 
the mechanism to work is therefore that the probability that Y n of moves up one 
level, i.e. e _n , is less than the probability that the block-extension is successful. We 
define p 9,9 (Bi-i,b{) as the infimum, over {x^ n \ x^), of the probability of success, 
conditioned on (x^ n \ x^), under the restriction that k(x^ h \ x^) > -B/-i- More 
precisely, we need to show that, the condition that r; is valid implies that 77 + s ■ bi 
is less than p 9,9 {Bi_i,bi). As before, we assume that a maximal coupling is used. 
Notice that, if the extension is executed at level I, we have n(x( n \ x^) >Y n = 

by (Z2D. 
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What remains to complete the proof of Lemma 12.11 is to show that 

p9>9(B,b) > p 9 ^(B,b) + s-b, 
and to give an explicit expression for p 9 := p 9 ' 9 . 

It is well-known that the probability for a successful extension in a maximal coupling 
is given by the total variation metric between the marginals of the extension, see e.g. 
[24]. The success probability is given by 



| A1 ) */=(i-^-*vM))- 

In our situation we can identify the marginals 77 and 77 with the distributions on J-j, 
given by 

for some x^ 71 ' and x^ n ' that satisfy n{x^ n \ x^) > B. Let Mg 9 b denote the set of 
such pairs (77, 77). 

We then define 

(2.9) p 9 ' 9 (B,b) := sup|-log j A d V : (77,77) G M 9 ^ . 
Notice that, since g/g > e~ s , we have 

(2 10) ^ = 9(x)g(Tx)---g(T b ~ 1 5;) > g(x)g(Tx) ■ ■ ■ gjT^x) 

K ' ' dri g(x)g(Tx) ■ ■ ■ g(T b ^x) ~ 6 ' g(x)g(Tx) ■ ■ ■ g{T b -^x) 

and the right hand side equals e~ bs ■ dfj/dr), where 

V : = 

We then obtain from (|2.9p that 

(2.11) fP*{B,b) < p 9 ^ 9 (B,b) + s-b, 
where 

(2.12) ^ = sup j-log J Al\ dr, : (77,77) G A^§f 6 
Since p 9 = p 9,9 , this concludes the proof of Lemma l2.1i □ 

2.3. Estimates using Hellinger integrals. In order to arrive at verifiable condi- 
tions that ensures that inf<5(r, {6/}) = 0, i.e. the assumption (I2.5P in Theorem 12.21 
we estimate the total variation metric using the Hellinger integral. This was done 
in some special cases also in our earlier paper [20]. Define the "Hellinger block- 
variation" h(B,b) = h 9 (B,b) by 

(2.13) h 9 (B,b) = sup {-log #(77,77) : (v,v) £ M 9 g b } 
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where 

i 

dfj\ 2 



is the Hellinger integral of rj and fj. We always have < H < 1. 

The relevant estimates we will need are collected in the following lemma. 

Lemma 2.4. We have the following relations between the block-variations defined 
above 



(2.14) p 9 < — log ( 1 — \J\ — exp(—2h 9 ) ) and, in particular, 



(2.15) p 9 <V2h9 + 2h 9 , 

B+b-l 

(2.16) h 9 {B,b) < h 9 (k,l), 

k=B 

As k — y oo 

(2.17) h 9 (k, 1) = (1 +o(l))i(var fc log 9 ) 2 , 
and as w — ^ 

(2.18) p 9 (B,b)<(l + 0(w))yw 
where 



w 



B+b 

^(var fe log^) 5 

k=B 



A condition ensuring that condition (|2.5p is satisfied is given in the following Theo- 
rem. We say that a block- variation ({?"/}, {fy}) is eventually valid if for some Iq, we 
have 77 > p 9 for I > Iq. 

Theorem 2.5. A sufficent condition for the conclusions of Theorem \2.S\ to hold is 
that there is some infinite eventually valid block variation pair ({n}^ 1 , {fy}^) such 
that lim sup r\ = and 

00 

(2.19) J2 e ~ n ri ~ lb l = °°- 

1=1 

Proof We verify (|2.5p . that is, we show that 

l + E^i^- ri -- r, - 1 (l-e- r ') _ n 



(2.20) inf 
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To see this, note that (1 — e n ) < n. Hence, by the assumption (|2.19l) and since 
ri — > 0, as / — > oo, we have 

inf ^ + TZ 1 b l e^--^r l = Q 

and the conclusion follows. □ 

2.4. Examples. By setting 6/ = 1 and noting that T\ = (1/2 + e) var/ log g eventually 
dominates p\ by (|2.18p . we can deduce the special case ([2]) in the Introduction. We 
now show the results under the hypotheses in the the special cases (UJ and ([3]), by 
verifying that the conditions in Theorem 12.51 are satisfied. 

Note that the following proposition gives a uniqueness result that is not covered by 
earlier results, for instance in 118] . 



Proposition 2.6. We have a unique g-measure with the Bernoulli property if 

var n logg = o( -j= 



'n . 

Proof. Take a real number c > 1. Let Bo = and let B[ = \c l /(c — 1)] for I > 1, so 
that for I > 2 bi = Bi — -B/_i satisfies 

J i 



> 



c7(c-i)-c'-7(c-i) =L<|>i. 



Define n by 

B,-l 

r ? = ( var « lo gf) 2 

n=B;_l 

For / > 2, we have by assumption that (as / — > oo) 

B,-l 

n 2 < (i)- E -4' 



1 



/•c ! /( C -l) 1 

< o(l) • / -dx 

= o(logc) = o((logc) 2 ). 

The integral estimate of the partial sums of the harmonic series follows since > 
c l - l /{c - 1) and B x - 1 < c7(c - 1). 

Since, by (|2.18p . p\ < ri eventually, we can apply Theorem 12.51 We already know 
that ri = o(logc) — > as / — > oo. Moreover, each term in the sum of (|2.19j) can be 
estimated as 

(2.21) bie~ ri ri > exp{l log c - I ■ o(log c)} oo 

which verifies (|2.19p . □ 
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We now show that the uniqueness condition of [18] also gives the Bernoulli property. 
Proposition 2.7. We have a unique g-measure with the Bernoulli property if 

^(var n logg) 2 < oo. 

n 

Proof. First note that if {r{\ is a block-variation relative to blocks {b{\ such that 

r\ + r 2 -\ < oo, 

then it is clear that the conditions in Theorem 12.51 hold for {77} and 
We define the blocks B[ such that Bq = and 

Bi = inf J B > B H : (var n logg) 2 < L/2 1 1 

I n=B J 

where L = Yln=o( vain 1°S5) 2 - Then with 77 defined by 



Bi-X 



^2 ( var n 1 °g5') 2 , 
n=B l _ 1 



we have r/ + i < OyyL/2 l j and {r/} is clearly a summable sequence since it decreases 
geometrically. Moreover, < n eventually by (|2.18p . □ 

3. Remaining proofs 

3.1. Proof of Lemma 12.41 Note that (|2. 18j) is easily deduced from (|2.17p and 
(EH]). 

Proof of (|2.14p and (|2.15p . In order to relate the two variation functions p 9 and h 9 , 
we use the following bound (Proposition V.4.4 in |17l p. 311]) on the total variaton 
metric 



(3.1) drv(v, V) < 2 v / l-^(r?,r/) 2 - 

This relation immediately gives (|2,14p by re-writing the relations in terms of p 9 and 
h 9 . From this, we obtain (I2.15P as a useful approximation by easy calculations. In 
the estimate (|2.15p , the first term y/2 ■ yh 9 is sharp (\/2 is the sharp number) , but 
the second, 2 • h 9 , is not. Slightly lower numbers than 2 are possible. □ 

Proof of ([2.16p . Let (77,77) G We can explicitly write 
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where (x,x) G (X + ,X + ) satisfies k(x,x) > B + b. Taking the conditional rj- 
expectation of conditioned on Tx gives 

H(v,V) = / h{Tx,Tx) — — drj(x) 

J \g{Tx) ■ ■ ■ g{T K 1 x)J 

where we have 

(3.3) h(y,y) = ^ ^/ g{ay) ^ g{ay) . 

Since — log h(Tx,Tx)) < —h 9 (B + 6—1, 1), we obtain the recursive expression 

- log H( V , fj) < h 9 (B + 6 — 1, 1) ■ {-log H(rf,ff)} , 
where (r]',fj') £ -Mb-16-i- This proves (12.16|) . □ 

Proof of (|2.17p . The relation (|2,17p follows from the Arithmetic-Geometric mean 
inequality: Fix (x,x) £ X + x Xq, and assume that g(x) = e s ( x ' x >g(x), say, where 
\5(x,x)\ < vax K ( X)S ) log 5. Then 

(3.4) Vg&VgJx) = \ (g{x) + g(x)) - 5 2 f(5)g(x), 
where / is the continuous and strictly positive function 



/(5) = ^Q(l + e 5 )-e^) 



tending to 1/8 as 5 — > 0. Summing f|3.4[) over y and y such that (y, y) = (aTx, aTx), 
a £ S, gives that 

- log h(Tx, Tx) = - log(l - ^2 5 2 (y, y)f(S(y, y))g{y)) 

y 

= {l + o{l))5 2 f(5), 

where h as in (13.3p . Taking the infimum over (Tx,Tx) such that k(Tx,Tx) > k 
proves PT7]> . □ 

3.2. Proof of Lemma 12.31 We now use renewal theory to show Lemma 12.31 Our 
aim is to prove that 

P(y n < 0) -> as n -> 00. 

The Markov chain {Y n } will return to at random times {So, S\, S%, . . . } where 
So = 0, since Yq = 0. For time n, define the number N n of returns as 

N n = \{k : < k < n,Y k = 0}| = sup{/c : < n}. 

Define the waiting times = Sk — St-i which are independent and identically 
distributed waiting times due to the Markov property of Y n . The waiting time Tn 
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is the length of the "cycle" that Y n currently completes and this cycle Ys N . . . Ys N +1 
has length B[ for some level I. Let L n denote this level, i.e. B^ n = T^ n . 

We now use the renewal equation to analyse 

(3.5) A n = F(Y n <0). 
The expansion 

(3.6) A n = ¥(Y n <0,N n = l)+ P(Y n < 0, N n > 1) 
leads to the renewal equation 

oo 

(3.7) A n = a n + ^ A n —jPj, 

i=i 

where a n = F(Y n <0,N n = 1) and pj = P(Ti = j). 

Let qi = F{L n = l}. Then 

qi = F{L n >l}-¥{L n >l + l} = e-^-'-n-! (i _ e -n) j 

where we use our convention that e ri r, ~ 1 = 1 when / = 1, i.e., q\ = 1 — e _ri . 
Note that 

j = B,, i = l,2,...,M-l 

0, otherwise. 

Since, qi is the probability that, in the first cycle, Y n < for -B/_i < n < B\ = T%, 
we obtain 

'\, n = 
</„ - < q h <n< 5/, Z = 1,2,...,M 

0, otherwise. 

It is well known that the renewal equation (|3.7p has the solution 

oo 

(3.8) A n = y~]u n -jaj, 

where u n = E[iV n ] — E[ iV n _i ] and the theorem in [ITJ P- 362] states that 

lim An = ^ r ~® . ° , 

n^oo n E[Ti] 

provided Ej^=o l a il < °°- I n our case we have T\ < Bm < oo and this condition is 
trivially satisfied. 

The ratio Ej°i/ E [ r i] 

can be transformed to that in (I2.ip . We have 



M 



M 



j=0 



;=1 



i=l 
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and 

M-l M-l 

E[Ti]= + E ®)^, 

2=1 Z=l 

where the last term is due to the fact that we let Y n+ \ = whenever Y n = Bm, 
recall the definition of pj above. Since B\ = b\ + 62 + • • • + fy, E[Tj ] equals 

M-l 

E (e- ri --^- 1 - e" ri -- r ')(6i +••• + &,)+ e -n--r W _a + 6m) 

z=i 

M 

= E^ e_ri "'"" n " 1 ' 

which is the denominator in ()2. 1 1) . □ 



3.3. Proof of the last statement in Theorem 12.21 We now prove the remaining 
statement in Theorem 12.21 That ()2.5p implies that L* n \J converges weakly to (the 
necessarily unique) (/-measure in M. 9 T for any initial distribution fi' 6 M(X + ). 

In fact, we prove convergence in the Wasserstein metric. Given an underlying 
(pseudo-) metric d on the space Y, the corresponding Wasserstein (pseudo-) metric 
dw between probability measures jl € M.{Y) is defined as 

d w (n, p) := inf £ A 5)] , 

A 

where the infimum is taken over all couplings X £ M(Y x Y) of /i and /L On the 
space we consider the underlying metric d(x, x) = 2~ fi ( x ' x ^ and the corresponding 
Wasserstein metric dw- 

We already know that the condition (|2.5|) in Lemma 12.21 implies that d-distance 
between any pair of (7-chains is zero. In other words 

(3.9) inf lim E, 

v n— >oo 

where v £ M(X x X) signifies couplings of the two arbitrary g-chains. We shall 
show that (|3.9|) implies that 

(3.10) lim sup d w (£*>, C* n fi) = 0. 

n— >oo 

Since dw metrizes the weak topology, (|3. 10|) is equivalent to stating that g has a 
unique attractive g-measure, i.e. is a for any /i, {£* n fi} converges weakly to a unique 
g- measure as n — > 00. 

The statement (|3,10p follows readily from (|3.9p : Let N > be fixed but arbitrary. 
A coupling 1/ £ M(I x I) of the ^-chains with initial distributions \x and p, also 
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gives a coupling A = v o (x^) 1 ® v o (x( n >) 1 of C n [i and C* n jl. Since 

N-l 

d(x,x) < 2~ N + l K < N (x,x) <2~ n +Y, l K =o(T n x,T n x) 

n=0 

it therefore follows from (13.91) that 



limsup d w (C* n f^,C* n fi) < 2~ N + limsup inf E, 



N-l 

l K =o(T fc x (n) ,T fc x (n) 

n=0 

< 2~ N + N-0. 



Since N was arbitrary, this concludes the proof. □ 
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